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Abstract 

In this paper we consider a class of nonparametric estimators of a 
distribution function F, with compact support, based on the theory of 
IFSs. The estimator of F is tought as the fixed point of a contractive 
operator T defined in terms of a vector of parameters p and a family of 
affine maps W which can be both depend of the sample (Xi, X2, . . . , X„). 
Given W, the problem consists in finding a vector p such that the fixed 
point of T is "sufficiently near" to F. It turns out that this is a quadratic 
constrained optimization problem that we propose to solve by penalization 
techniques. If F has a density /, we can also provide an estimator of / 
based on Fourier techniques. IFS estimators for F are asymptotically 
equivalent to the empirical distribution function (e.d.f.) estimator. We 
will study relative efficiency of the IFS estimators with respect to the e.d.f. 
for small samples via Monte Carlo approach. 

For well behaved distribution functions F and for a particular family 
of so-called wavelet maps the IFS estimators can be dramatically better 
than the e.d.f. (or the kernel estimator for density estimation) in presence 
of missing data, i.e. when it is only possibile to observe data on subsets 
of the whole support of F. 

This research has also produced a free package for the R statistical 
environment which is ready to be used in applications. 

key words: iterated function systems, distribution function estimation, 
nonparametric estimation, missing data, density estimation. 



1 Introduction 

Let Xi, X2, . . . , Xn be an i.i.d. sample drawn from a random variable X with 
unknown distribution function F with compact support The empirical 
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distribution function (c.d.f.) 

1 " 

F,,ix)^-y2x{x^<x) 

1=1 

is one commonly used estimator of the unknown distribution function F (here 
X is the indicator function). The e.d.f. has an impressive set of good statistical 
properties such as it is first order efficient in the minimax sense (see 0], 
^14 , 15 , and 8 ). More or less recently, other second order efficient estimators 
have been proposed in the literature for special classes of distribution functions 
F . Golubev and Levit (1996a, b) and [S] are two of such examples. It is rather 
curious that a step-wise function can be such a good estimator and, in fact, |S] 
shows that, for the class of analytic functions, for small sample sizes, the e.d.f. 
is not the best estimator. In this paper we study the properties of a new class 
of distribution function estimators based on iterated function systems (IFSs) 
introduced by the authors in a previous work jT2]. IFSs have been introduced 
in fTT| and T . The main idea on which this method is based consists of thinking 
the estimation of F as the fixed point of a contraction T on a complete metric 
space. The operator T is defined in terms of a family of affine maps W and a 
vector of parameters p. For a given family W, T depends only on the choice 
•p. The idea, known as inverse approach (see Section ^ is to determine p by 
solving a constrained quadratic optimization problem built in terms of sample 
moments. In this paper this optimization problem is solved by a penalization 
method. The nature of affine maps allow to derive easily the Fourier transform 
of F and, when available, an explicit formula for the density of F via anti Fourier 
transform. In this way, given W and p we have at the same time estimators for 
the distribution, characteristic and density functions. 

The paper is organized as follows. In Section |21 the inverse approach is 
presented and a penalization method is proposed in order to solve a quadratic 
optimization problem. We also discuss the choice of the family of maps W. In 
Section|3]numerical results and comparisons with classical estimators are shown 
for small samples via Monte Carlo Analysis. 

Finally we show an application of these estimators when the empirical dis- 
tribution function (or the kernel density estimator for the density) cannot be 
applied. We will consider situations of missing data when, for example, the 
data can only be observed on some windows of the support of F. This can be 
the case of directional data analysis when, for some reason, instruments are not 
able for technical or physical reason to collect data in same range of angles say 
A and B, A,B C [0, 27r]. For x in ^ or B the e.d.f. will be constant and, at the 
same time, the kernel density estimator will estimate a plurimodal distribution 
for these data. In this case we will show examples in which the IFS estimator 
does it job incredibly well. 

Tables and figures can be found at the end of the paper after the references. 
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2 An IFS estimator 



The theory of distribution function approximation via IFSs we wih use to derive 
estimators is due to jB). Resuhs from this section, apart from were exphcitly 
mentioned, are from the cited authors. Let A4{X) be the set of probabihty 
measures on B{X), the cr-algebra of Borel subsets of X where {X, d) is a compact 
metric space (in our case will he X — [a, P] and d the Euclidean metric.) 
In the IFSs literature the following Hutchinson metric plays a crucial role 

dnifJ-,}^)^ sup <^ / /d/i- / fdi^>, pL,v e M{X) 
/eLip(x) iJx Jx ) 



where 



Up{X) ^{f:X^R, \f{x) - f{y)\ < d{x, y), x,y € X} 



thus {M{X),dH) is a complete metric space [?, see]] hutch. 

We denote by (vif,p) an N-maps contractive IFS on X with probabihties 
or simply an N-maps IFS, that is, a set of N affine contraction maps, w = 

{Wi,W2, ■ . . ,WAr), 

Wi — Ui + bi x, with \bi\ < 1, 6^, e M, i — 1,2, ... , N, 

with associated probabilities p ~ {pi,p2, . . . ,Pn), Pi > 0, and X^iLiK ^ 1- The 
IFS has a contractivity factor defined as 

c — max IbA < 1 

l<i<Af 

Consider the following (usually called Markov) operator M : Ai{X) — > A4{X) 
defined as 



N 

Mp^^p^pow^^, peMiX), (1) 



where ^ is the inverse function of Wi and o stands for the composi- 
tion. In Hutchinson (1981) it was shown that M is a contraction mapping 
on {M{X),dH) i.e. for all p,^ e M{X), dH{Mp,Mv) < cdnip^v). Thus, 
there exists a unique measure p e fA{X), the invariant measure of the IFS, 
such that Mp = phy Banach theorem. Associated to each measure p e Ai{X), 
there exists a distribution function F. In terms of it the previous operator M 
can be rewritten as 

'0 if a; < a 

TV 

TF{x) = { J2 PtF{w-^{x)) ifa<x< P 
1=1 

1 ifx>f3 
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2.1 Minimization approach 

For affine IFSs there exists a simple and useful relation between the moments 
of probability measures on M{X). Given a iV-maps IFS(w,p) with associated 
Markov operator M, and given a measure fi £ M{X) then, for any continuous 
function f : X ^ R, 



N 

[ /(x)di.(x)= / /(x)d(M/.)(a;) / {f o w,){x)dfi{x) , (2) 

where = M^. In our case X = [a, (3] C M. so we readly have a relation 
involving the moments of /i and i^. Let 

gk= [ xM/i, hk= [ x'^di^, fc = 0,l,2,..., (3) 

JX JX 

be the moments of the two measures, with go ^ Hq = 1. Then, by with 
f{x) = x'', we have 

k ( N ^ 



Set X = [a, (j\ and let /i and /i'^-'^ G A^(X), j = 1, 2, . . . with associated moments 
of any order and 

= / x'^dM^^') . 

Then, the following statements are equivalent (as j — > oo and Vfc > 0): 

2. V/ e C(X), Jj^ fdfi^^^ J^fdfi, (weak* convergence), 

3. dHifl^'\^i)^0. 

(here C{X) is the space of continuous functions on X). This result gives 
a way to find and appropriate set of maps and probabilities by solving the so 
called problem of moment matching. With the solution in hands, given the 
convergence of the moments, we also have the convergence of the measures and 
then the stationary measure of M approximates with given precision (in a sense 
specified by the collage theorem below) the target measure fi [?, see]]bd. 

Next result, called the collage theorem is a standard product of the IFS 
theory and is a consequence of Banach theorem. 

(Collage theorem) : Let {Y, dy) be a complete metric space. Given an 
y (z Y, suppose that there exists a contractive map / on Y with contractivity 
factor < c < 1 such that dY{y,f{y)) < e. If y is the fixed point of /, i.e. 
/(y) = y, then dY{y,y) < j^- 
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So if one wishes to approximate a function y with the fixed point y of an 
unknown contractive map /, it is only needed to solve the inverse problem of 
finding / which minimizes the collage distance dY{y,fiy))- 

The main result in Forte and Vrscay that we will use to build one of the IFS 
estimators is that the inverse problem can be reduced to minimize a suitable 
quadratic form in terms of the pi given a set of affine maps Wi and the sequence 
of moments gk of the target measure. Let 

= |p (Pi,P2, ■ ■ • ,Pn) ■■ Pz > 0,^Pj = l| 

be the simplex of probabilities. Let w ~ {wi,W2, ■ ■ ■ ,wn), N — 1,2,... be 
subsets of W = {wi, W2, ■ ■ ■} the infinite set of affine contractive maps on X = 
[a,(3] and let g the set of the moments of any order of /i G Ai{X). Denote 
by M the Markov operator of the A^-maps IFS (w, p) and by = with 
associated moment vector of any order h^r. The collage distance between the 
moment vector of fi and vn 



A(p) = ||g-hjv||p :n 



N 



is a continuous function and attains an absolute minimum value Amin on 11 
where 



N 



i.2 



fc=i 



Moreover, A^j^^ ^ as ^ oo. Thus, the collage distance can be made 
arbitrarily small by choosing a suitable number of maps and probabilities. 

The above inverse problem can be posed as a quadratic programming one in 
the following notation 



5(p) = (A(p))2 = 



k=l 



D{X)^{g^igQ,g,,...):gk^ [ x^dfi, k ^ 0,1, . . . , e M{X)} 

Jx 

Then by Q there exists a linear operator A : D{X) D{X) associated to 
M such that hjy — Ag. In particular 

^ °° ^k\ 

hk = ^k^P^, k = l,2,... where A^, = ^ f . j &f < ^ g^ (4) 

i=l j=o ^-^^ 

Thus 

(Q) s(p) = p*gp + B*p + c, 

oo 

where Q = [g^], qi, = ^ — T2^> i,i = 1, 2, . . . , iV, 

k=l 
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oo oo 2 

^' = -2Efl^^- z=l,2,...,iV and C = J] || . (5) 
fc=i fe=i 
The series above are convergent as < A^i < 1 and the minimum can be found 
by minimizing the quadratic form on the simplex H^. 

The estimator wih then be buih by substituting the moments of the target 
measure with the empirical moments and by truncation of the above series to a 
finite sum. 

2.2 Numerical solutions 

When practical cases are considered, in particular concerning estimation, the 
previous series have to be truncated and this implies that the matrix Q is as- 
sured to be definite positive. Standard numerical procedures for the minimiza- 
tion of constrained quadratic optimization problems involving positive definite 
quadratic forms cannot be used in this context. To solve this problem an ap- 
proach is to build the following penalized function L\ 

Lxip) = p'Qp + B*p + C + A I 1 - ) 



i=l 



and then to study the following problem 



(LOP) minLA(p), < < I 



It is trivial that an optimizer p* of (LOP) such that J2iLi = 1 is also an 
optimizer for the problem 



(OP) min5(p), pen^ 

For solving (LOP) numerically, we have used the method L-BFGS-B due 
to |2j which allows to minimize a nonlinear function with box constraints, i.e. 
when each variable can be given a lower and/or upper bound. The initial value 
of this procedure must satisfy the constraints. This uses a limited-memory 
modification of the BFGS quasi-Newton method. The method '"BFGS"' is a 
quasi- Newton method (also known as a variable metric algorithm) . 

2.3 The choice of alRne maps 

As we are mostly concerned with estimation, we briefiy discuss the problem 
of choosing the maps. In 6 the following two sets of wavelet- type maps are 
proposed. Fixed and index i* G N, define 

7»i = TT. I = 1,2,..., I J = 1,2,..., 2 
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and 



X — a + {j — ~ a) 



i = 2,...,i* j = 2,...,i. 



I 



Then set = ^ 2* or = — l)/2 respectively. To choose the maps, 



consider the natural ordering of the maps and operate as follows 

Wl = {Wl =7ll,W2 =7l2,'i«3 =721,---,W6 = 724,W7 = 731 , . . . , Wat =7..2i*} 



respectively. In |12| we proposed the following quantile based maps 



where qi = F^^{ui), and = ui < M2 < . . . < un < um+i = 1 are 7V + 1 equally 
spaced points on [0, 1]. With these maps, it has been shown that, there is no 
need to use a moment matching approach. In particular, given pi — 1/N , the 
IFSs turns out to be a smoother of the e.d.f. and so it has nice small sample 
and asymptotic statistical properties (see cited reference) even for non compact 
support distribution functions F. Here we will also mix the quantile information 
with the moment matching idea. To distinguish the two cases (fixed pi = 1/N 
or p solution of (QP)) we will use the notation Qi and Q2 later on. 

2.4 Fourier analysis results 

We recall, from [7| results that are rather straight forward to prove but also 
essential to us since we will use them in density estimation and in particular in 
presence of missing data. Simplicity is due to affinity of the maps. We assume 
that the support of the measures is X = [0, 1] without loss of generality. 

Given a measure /x G A4(X), the Fourier transform (FT) </> : R — » C, where 
C is the complex space, is defined by the relation 



with the well known properties (/)(0) = 1 and \4>{t)\ < 1, G M. It can be 
shown that the space of characteristic functions !FT{X) can be made metric and 
complete with an opportune metric. Thus, given a TV-maps affine IFS(w,p) it 
is possibile to define a new linear operator B : !FT{X) !FT{X) whose unique 
fixed point reads as 



and 



Qi = {wi{x) = [qi+i - qi)x + gi, i = 1, 2, . . . , iV} 




N 
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This (j){t) is the FT of the fixed point of the iV-maps affine IFS(w, p). Now [?, 
see e.g.]]tarter, suppose that the target distribution F admits a density /. It is 
possible to write the density / via Fourier expansion. In fact, 



1 

itx ^ 







thus 



, +00 

/(x)-— V BkC^'^^ where = 0(fc) . 

ZTT — ^ 

A;— — 00 

3 Relative efficiency and estimation in presence 
of missing data 

Suppose to have an i.i.d. sample on n observations with common unknown dis- 
tribution function F with compact support on [a, f3] which has all the moments 
up to order M. An IFS estimator of F is the fixed point of the functional TF 
where the iV maps are choosen in advance and the pi are the solution of the 
(QP) quadratic programming problem where in the expression on Aik, Bi and 
C we replace, in equations (O and Q, the true moments gk with the sample 
moments m^, fc = 0, 1, . . . , M for a fixed M and we consider the first M terms 
of the series involved. 

Given the solution of (QP), we have an estimator for F and an estimator 
for the characteristic function of F, say 0. Suppose that F posseses a density 
/ then we have further a (Fourier) density estimator for / 



k— — 'rn 

1 1 f - - 1 

= — + - y^<^ Re{Bk) cos{kx) - lm{Bk) sm{kx) > 
71- ^ 1^ J 

where B^ = 4>{k) and m, the number of Fourier terms, is choosen in the usual 
way, i.e. 

2 ,29 

then use the first m coefiicents 







2 




2 


if 


Bm+1 


and 


Bm+2 


< 



n + 1 



[?, see again]]tarter. Tables ^ £^nd |21 show camparisons between the empirical 
cumulative distribution function and the IFS estimator, say T^r, for some 
target distributions F, in terms of average mean square error (AMSE) and 
sup-norm (SUP) distance. These tables contain Monte Carlo analysis where 
100 simulations have been done for each target distribution. Tables report the 
average ratio of the sup norm (and AMSE) of the IFSs over the corresponding 
sup norm (respectively AMSE) of the empirical distribution function. 
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It is possible to notice that the IFS estimator based on maps Wi has good 
properties for symmetric bell-shaped distributions and distributions with not so 
heavy tails (see also Figure It is also evident the asymptotic equivalence of 
the IFSs to the e.d.f. when quantile maps are used. Remark that, for Wi we 
have decided to use 62 maps, for W2 28 maps and n/2 quantiles for the quantiles 
maps Qi and Q2. So it is evident that for wavelet-type maps an adjustment 
can be done by choosing a suitable number of maps in terms of the sample size 
n. 

3.1 What if data are missing? 

Suppose now that the for some reason, the n sample observations from F are 
in fact a subset of a biggest sample, of unknown size. In practice we do not 
observe the data on the whole support of F [a, /3] but only on some windows. 
This sample reduction has happened due to some sort of censoring. So we are in 
presence of missing data when we do not know how many data are missing and 
where exactly they were missed, i.e. we are not in a classical censoring setup. 
A motivation for this scheme of (non)-observation is the following: suppose one 
wants to estimate the distribution of the angle of the wind registered by some 
instruments in degrees (0,360). For some reason, data from angles (15,37) and 
(62,79) are missing for technical failures or physical obstacles. In this case the 
empirical distribution function will be flat on these windows and a kernel density 
estimator will probably show a bimodal behaviour. 

Heuristically, this is due to the fact that quantile estimation is inappropriate 
in this context. At the same time, moments estimation tend to be more ro- 
bust, in particular if the distribution is symmetric. We only report a graphical 
example of what can happen. Figure^is about a sample from a Beta(2,2) dis- 
tribution when only the observation in (.1, .15) U (.37, .43) U (.7, .8) are available 
to the observer all the other being truncated by the instrument (we have choose 
this interval by hazard). The IFS estimator with Wi maps seems to be able to 
reconstruct the underlying distribution and density function, whistle, for obvi- 
ous reasons both the e.d.f. and the kernel estimators fail. In this example the 
relative efficiency (IFS/EDF) is 7% for the AMSE and 23% for the SUP-norm 
which is dramatically better than expected! 

3.2 Algorithm flow for estimation 

1. calculate sample moments 

2. choose the family of maps W 

3. build the quadratic form and solve it for p 

4. if you want to estimate F at point x: take any distribution function, for 
example the uniform over [a, /3] and start to iterate T 

5. stop after few iteration (normally 5 is enough) 
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6. the "fixed point" of T evaluated in x is the estimate of F{x) 

In case the support of F is not known one case use the range of the sample but 
the resulting IFS estimator will then try to approximate a distribution function 
which has exactly that support. If any hints on the shape of the distribution F 
is available, use it to choose the maps. 

All the examples, tables and graphics have been done by some software 
developed by the authors. In particular, a package called if s is freely available 
for the R environment system [13 in the CRAN (Comprehensive R Archive 
Network) http://cran.R-project.org under the contributed section. 

Conclusions 

It seems that this kind of approach can be used to make nonparametric inference 
when data are missing or sample size are small. Remark that with this method 
it is only possible to work with distributions with compact support. Moreover, 
a knowledge on the support itself it is needed. Neverthless, it seams a promising 
approach and the use of different sets of maps merits further investigation. 
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parameters 



AMSE 



SUP 



n 


law 


Wi 


W2 


Si 


C2 


Wi 


W2 


Si 


S2 


10 


bota(.9,.l) 


81 08 


77.05 


203.53 


149 68 


85.76 


75.44 


110.11 


110.81 


10 


bota(.l,.9) 


211.78 


2024.68 


203.39 


258.88 


175.32 


441.32 


114.51 


161.55 


10 


beta(.l,.l) 


118.27 


416.17 


182.88 


104.07 


114.87 


192.94 


119.57 


106.56 


10 


beta(2, 2) 


56.47 


80.53 


67.68 


112.46 


53.31 


69.24 


70.36 


98.21 


10 


beta( 5, 5) 


52.77 


57.90 


110.35 


152.29 


53.99 


54.83 


81.61 


125.67 


10 


beta( 3, 5) 


55.95 


71.07 


99.92 


142.52 


51.93 


60.58 


81.72 


116.79 


10 


beta( 5, 3) 


52.50 


57.34 


91.75 


131.37 


51.74 


52.47 


77.97 


109.84 


10 


beta( 1, 1) 


73.35 


119.04 


79.01 


102.04 


65.63 


90.40 


70.89 


90.85 


parameters 




AMSE 






SUP 




n 


law 


Wi 


Wa 


Qi 


Q2 


Wi 


W2 


Si 


S2 


20 


bcta(.9,.l) 


94.69 


85.25 


201.85 


169.78 


90.30 


79.92 


105.02 


123.28 


20 


bota(.l,.9) 


388.83 


4183.36 


203.70 


195.36 


257.13 


612.55 


109.10 


122.99 


20 


bota(.l,.l) 


154.1 


690.08 


125.35 


97.53 


139.65 


255.26 


103.56 


99.28 


20 


bota(2,2) 


61.46 


93.37 


85.46 


95.49 


55.34 


73.95 


84.42 


91.38 


20 


beta( 5, 5) 


54.31 


52.89 


105.84 


131.84 


53.76 


48.73 


85.85 


106.27 


20 


beta( 3, 5) 


60.42 


67.33 


93.30 


118.51 


55.98 


60.88 


85.39 


101.16 


20 


beta( 5, 3) 


53.82 


57.72 


92.26 


114.84 


53.46 


52.20 


85.23 


102.85 


20 


beta(l, 1) 


95.93 


89.79 


71.66 


154.54 


63.20 


106.95 


81.56 


82.54 


parameters 




AMSE 






SUP 




n 


law 


Wi 




Qi 


Q2 


Wi 


W2 


Si 


Q2 


30 


bcta(.9,.l) 


107.46 


90.27 


195.39 


143.00 


101.83 


81.05 


108.59 


109.85 


30 


beta(.l,.9) 


540.73 


6462.03 


190.82 


213.45 


107.80 


137.26 


314.53 


759.57 


30 


beta(.l,.l) 


112.66 


97.04 


233.50 


1342.44 


186.70 


356.91 


103.39 


99.98 


30 


bota(2,2) 


60.30 


92.92 


88.90 


96.88 


53.71 


72.06 


84.92 


89.11 


30 


bola( 5, 5) 


62.04 


56.07 


100.26 


121.41 


60.08 


51.82 


89.26 


100.16 


30 


bola( 3, 5) 


70.31 


76.90 


93.02 


108.76 


61.68 


66.29 


86.36 


95.24 


30 


bcta( 5, 3) 


55.78 


56.85 


92.10 


102.02 


55.56 


51.21 


88.20 


94.75 


30 


beta( 1, 1) 


71.88 


211.28 


94.36 


88.17 


63.15 


121.23 


83.74 


83.40 



Table 1: Relative efficiency of IFS estimators with different set of maps Wi , W2, 
Qi and Q2 with respect to the empirical distribution function (i.e. IFS/EDF). 
Based on 100 Monte Carlo simulation for each distribution. Small sample sizes. 
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parameters AMSE SUP 



n 


law 


Wi 


W2 


Qi 


Q2 


Wi 


W2 


Qi 


Q2 


50 


bota(.9..1) 


132.67 


115.10 


163.33 


129.24 


109.18 


88.77 


103.37 


101.90 


50 


bola(.1..9) 


1044.12 


12573.16 


181.99 


180.42 


421.49 


991.33 


104.37 


123.39 


50 


bola(.l,.l) 


306.49 


1917.23 


105.68 


97.27 


214.27 


430.63 


100.13 




98.04 


50 


bota( 2, 2) 


63.03 


106.56 


95.35 


95.66 


58.39 


80.00 


89.42 




89.36 


50 


bcta( 5, 5) 


68.94 


60.19 


102.22 


114.92 


66.77 


55.49 


91.86 




97.40 


50 


bcta( 3, 5) 


79.98 


93.80 


96.20 


102.32 


66.76 


77.57 


91.39 




93.76 


50 


beta( 5, 3) 


63.13 


62.21 


93.59 


98.47 


62.04 


55.95 


90.66 




93.19 


50 


beta( 1, 1) 


73.47 


304.41 


97.24 


92.19 


62.69 


150.39 


87.38 




86.30 


parameters 




AMSE 






SUP 






n 


law 


Wi 


Wa 


Qi 


C2 


Wi 


W2 


Qi 




C2 


100 


beta(.9,.l) 


195.54 


158.80 


140.55 


108.27 


138.93 


105.31 


102.25 


99.07 


100 


beta(.l,.9) 


1557.30 


20324.60 


135.45 


125.94 


536.84 


1267.81 


103.87 


106.05 


100 


beta(.l,.l) 


554.11 


3918.62 


102.67 


98.29 


304.59 


625.75 


99.10 




98.04 


100 


bcta(2, 2) 


61.63 


165.60 


95.58 


97.46 


57.18 


98.50 


92.11 




93.09 


100 


bcta(5, 5) 


87.97 


67.79 


99.28 


108.21 


78.94 


60.96 


94.83 




96.52 


100 


bcta(3, 5) 


111.30 


134.54 


100.68 


103.31 


79.59 


100.20 


95.35 




95.72 


100 


beta( 5, 3) 


61.03 


57.19 


97.28 


101.32 


65.97 


55.08 


94.14 




95.42 


100 


beta( 1, 1) 


67.91 


558.50 


97.71 


94.87 


58.71 


201.10 


90.83 




89.97 


parameters 




AMSE 






SUP 






n 


law 


Wi 


Wa 


Ci 


Q2 


Wi 


W2 


Qi 




S2 


250 


bcta(.9,.l) 


338.72 


255.23 


115.25 


101.55 


180.29 


131.97 


100.68 


99.43 


250 


bcta(.l,.9) 


3979.61 


50448.13 


117.81 


105.37 


874.65 


2045.15 


100.82 


99.73 


250 


bota(.l,.l) 


1345.72 


10051.20 


100.60 


98.97 


480.12 


977.30 


99.16 




98.73 


250 


bota(2,2) 


79.01 


275.93 


98.59 


98.30 


67.14 


132.87 


95.50 




95.24 


250 


beta( 5, 5) 


163.68 


99.35 


99.07 


100.54 


111.38 


78.48 


96.40 




96.83 


250 


beta( 3, 5) 


212.17 


228.58 


99.45 


99.69 


113.70 


142.21 


96.57 




96.32 


250 


beta( 5, 3) 


91.32 


73.31 


99.05 


99.20 


88.87 


67.13 


96.84 




97.24 


250 


beta( 1, 1) 


69.03 


1165.61 


99.47 


98.46 


61.07 


293.58 


94.88 




94.55 



Table 2: Relative efficiency of IFS estimators with different set of maps Wi , W2, 
Qi and Q2 with respect to the empirical distribution function (i.e. IFS/EDF). 
Based on 100 Monte Carlo simulation for each distribution. Moderate to big 
sample sizes. 
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IFS (red) vs EDF (blue) 




Figure 1: Data from a Beta(2,2) distribution when only the observation in 
(.1, .15) U (.37, .43) U (.7, .8) are available to the observer all the other being 
truncated by the instrument. The observations are marked as vertical ticks. 
The IFS estimator with Wi maps seems to be able to reconstruct the underlying 
distribution and density function, whistle, for obvious reasons both the edf and 
the kernel estimators fail. Notice that the arbitrary choice of the window of 
observation can be changed without substantial loss or gain. In this example 
the relative efficiency (IFS/EDF) is 7% for the AMSE and 23% for the SUP- 
norm. 
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IFSs vs EDF, beta(2,2) 




Figure 2: Relative efficiency of IFS estimator for different set of maps Wi, W2, 
Qi and Q2 witfi respect to tlie empirical distribution function. Based on 100 
Monte Carlo simulations. SUP-norm up, AMSE bottom. 
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IFSs vs EDF, beta(5,3) 




Figure 3: Relative efficiency of IFS estimator for different set of maps Wi, W2, 
Qi and Q2 witfi respect to tlie empirical distribution function. Based on 100 
Monte Carlo simulations. SUP-norm up, AMSE bottom. 
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IFSsvsEDF,beta(.1,.9) 




Figure 4: Relative efficiency of IFS estimator for different set of maps Wi, W2, 
Qi and Q2 with respect to the empirical distribution function. Based on 100 
Monte Carlo simulations. SUP-norm up, AMSE bottom. 
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